ADO.NET Entity Framework

ADO.NET Entity Framework (EF) is an object-relational mapping (ORM) framework for the .NET Framework.

Contents

Overview

ADO.NET Entity Framework abstracts the relational (logical) schema of the data that is stored in a database and presents its conceptual schema to the application. At development time, this abstraction eliminates the object-relational impedance mismatch that is otherwise common in conventional database-oriented programs. The run-time overhead of mapping between the conceptual schema and the underlying relational schema is still a factor to be considered.

For example, in a conventional database-oriented system, entries about a customer and their information can be stored in a Customer table, their orders in an Order table and their contact information in yet another Contact table. The application that deals with this database must "know" which information is in which table, i.e., the relational schema of the data is hard-coded into the application.

The disadvantage of this approach is that if this schema is changed the application is not shielded from the change. Also, the application has to perform SQL joins to traverse the relationships of the data elements in order to find related data. For example, to find the orders of a certain customer, the customer needs to be selected from the Customer table, the Customer table needs to be joined with the Order table, and the joined tables need to be queried for the orders that are linked to the customer.

This model of traversing relationships between items is very different from the model used in object-oriented programming languages, where the relationships of an object's features are exposed as Properties of the object and accessing the property traverses the relationship.

History

The first version of Entity Framework (EFv1) was included with .NET Framework 3.5 Service Pack 1 and Visual Studio 2008 Service Pack 1, released on 11 August 2008. This version has been widely criticized, even attracting a 'vote of no confidence' signed by approximately one thousand developers.[1]

The second version of Entity Framework, named Entity Framework 4.0 (EFv4), was released as part of .NET 4.0 on 12 April 2010 and has addressed many of the criticisms made of version 1.[2]

A third version of Entity Framework, version 4.1, was released on April 12, 2011.

A refresh of version 4.1 named Entity Framework 4.1 Update 1, was released on July 25, 2011. It includes bug fixes and new supported types.

Architecture

The architecture of the ADO.NET Entity Framework, from the bottom up, consists of the following:

Entity Data Model

The Entity data model (EDM) specifies the conceptual model (CSDL) of the data via the Entity-Relationship data model, which deals primarily with Entities and the Associations they participate in. The EDM schema is expressed in the Schema Definition Language (SDL), which is an application of XML. In addition, the mapping (MSL) of the elements of the conceptual schema (CSDL) to the storage schema (SSDL) must also be specified. The mapping specification is also expressed in XML.[3]

Visual Studio also provides Entity Designer, for visual creation of the EDM and the mapping specification. The output of the tool is the XML file (*.edmx) specifying the schema and the mapping. Edmx file contains EF metadata artifacts (CSDL/MSL/SSDL content). These 3 files (csdl, msl, ssdl) can also be created or edited by hand.

Mapping

Entity Data Model Wizard[4] in Visual Studio initially generates a 1:1 (one to one) mapping between the database schema and the conceptual schema in most of the cases. In the relational schema, the elements are composed of the tables, with the primary and foreign keys gluing the related tables together. In contrast, the Entity Types define the conceptual schema of the data.

The entity types are an aggregation of multiple typed fields – each field maps to a certain column in the database - and can contain information from multiple physical tables. The entity types can be related to each other, independent of the relationships in the physical schema. Related entities are also exposed similarly – via a field whose name denotes the relation they are participating in and accessing which, instead of retrieving the value from some column in the database, traverses the relationship and returns the entity (or a collection of entities) it is related with.

Entity Types form the class of objects entities conform to, with the Entities being instances of the entity types. Entities represent individual objects which form a part of the problem being solved by the application and are indexed by a key. For example, converting the physical schema described above, we will have two entity types:

The logical schema and its mapping with the physical schema is represented as an Entity Data Model (EDM), specified as an XML file. ADO.NET Entity Framework uses the EDM to actually perform the mapping letting the application work with the entities, while internally abstracting the use of ADO.NET constructs like DataSet and RecordSet. ADO.NET Entity Framework performs the joins necessary to have entity reference information from multiple tables, or when a relationship is traversed. When an entity is updated, it traces back which table the information came from and issues SQL update statements to update the tables in which some data has been updated. ADO.NET Entity Framework uses eSQL, a derivative of SQL, to perform queries, set-theoretic operations, and updates on entities and their relationships. Queries in eSQL, if required, are then translated to the native SQL flavor of the underlying database.

Entity types and entity sets just form the logical EDM schema, and can be exposed as anything. ADO.NET Entity Framework includes Object Service that presents these entities as Objects with the elements and relationships exposed as properties. Thus Entity objects are just front-end to the instances of the EDM entity types, which lets Object Oriented languages access and use them. Similarly, other front-ends can be created, which expose the entities via web services (e.g., Astoria) or XML which is used when entities are serialized for persistence storage or over-the-wire transfer.[5]

Entities

Entities are instances of EntityTypes; they represent the individual instances of the objects (such as customer, orders) to which the information pertains. The identity of an entity is defined by the entity type it is an instance of; in that sense an entity type defines the class an entity belongs to and also defines what properties an entity will have. Properties describe some aspect of the entity by giving it a name and a type. The properties of an entity type in ADO.NET Entity Framework are fully typed, and are fully compatible with the type system used in a DBMS system, as well as the Common Type System of the .NET Framework. A property can be SimpleType, or ComplexType, and can be multi valued as well. All EntityTypes belong to some namespace, and have an EntityKey property which uniquely identifies each instance of the entity type. The different property types are distinguished as follows:

All entity instances are housed in EntityContainers, which are per-project containers for entities. Each project has one or more named EntityContainers, which can reference entities across multiple namespaces and entity types. Multiple instances of one entity type can be stored in collections called EntitySets. One entity type can have multiple EntitySets.

EDM primitive types (simple types):[6][8]

EDM type CLR type mapping
Edm.Binary Byte[]
Edm.Boolean Boolean
Edm.Byte Byte
Edm.DateTime DateTime
Edm.DateTimeOffset DateTimeOffset
Edm.Decimal Decimal
Edm.Double Double
Edm.Guid Guid
Edm.Int16 Int16
Edm.Int32 Int32
Edm.Int64 Int64
Edm.SByte SByte
Edm.Single Single
Edm.String String
Edm.Time TimeSpan

Relationships

Any two entity types can be related, by either an Association relation or a Containment relation. For example, a shipment is billed to a customer is an association whereas an order contains order details is a containment relation. A containment relation can also be used to model inheritance between entities. The relation between two entity types is specified by a Relationship Type, instances of which, called Relationships, relate entity instances. In future releases, other kinds of relationship types such as Composition, or Identification, may be introduced. Relationship types are characterized by their degree (arity) or the count of entity types they relate and their multiplicity. However, in the initial release of ADO.NET Entity Framework, relationships are limited to a binary (of degree two) bi-directional relationship. Multiplicity defines how many entity instances can be related together. Based on multiplicity, relationships can be either one-to-one, one-to-many, or many-to-many. Relationships between entities are named; the name is called a Role. It defines the purpose of the relationship.

A relationship type can also have an Operation or Action associated with it, which allows some action to be performed on an entity in the event of an action being performed on a related entity. A relationship can be specified to take an Action when some Operation is done on a related entity. For example, on deleting an entity that forms the part of a relation (the OnDelete operation) the actions that can be taken are:[9]

For association relationships, which can have different semantics at either ends, different actions can be specified for either end.

Schema definition language

ADO.NET Entity Framework uses an XML based Data Definition Language called Schema Definition Language (SDL) to define the EDM Schema. The SDL defines the SimpleTypes similar to the CTS primitive types, including String, Int32, Double, Decimal, Guid, and DateTime, among others. An Enumeration, which defines a map of primitive values and names, is also considered a simple type. Enumerations are unsupported in the current version of the framework. ComplexTypes are created from an aggregation of other types. A collection of properties of these types define an Entity Type. This definition can be written in EBNF grammar as:

EntityType  ::= ENTITYTYPE entityTypeName [BASE entityTypeName] 
 [ABSTRACT true|false] KEY propertyName [, propertyName]*   
 {(propertyName PropertyType [PropertyFacet]*) +} 

PropertyType ::= ((PrimitiveType [PrimitiveTypeFacets]*) 
 |  (complexTypeName) | RowType
PropertyFacet ::= ( [NULLABLE true | false] |  
 [DEFAULT defaultVal] | [MULTIPLICITY [ 1|*] ] )   
PropertyTypeFacet ::= MAXLENGTH | PRECISION | SCALE | UNICODE | FIXEDLENGTH | COLLATION 
 | DATETIMEKIND | PRESERVESECONDS
PrimitiveType ::=  BINARY | STRING | BOOLEAN
 | SINGLE | DOUBLE | DECIMAL | GUID
 | BYTE | SBYTE | INT16 | INT32 | INT64
 | DATETIME | DATETIMEOFFSET | TIME )

Facets[10] are used to describe metadata of a property, such as whether it is nullable or has a default value, as also the cardinality of the property, i.e., whether the property is single valued or multi valued. A multiplicity of “1” denotes a single valued property; a “*” means it is a multi-valued property. As an example, an entity can be denoted in SDL as:

 <ComplexType Name = "Addr">
     <Property Name = "Street" Type = "String" Nullable = "false" />
     <Property Name = "City" Type = "String" Nullable = "false" />
     <Property Name = "Country" Type = "String" Nullable = "false" />
     <Property Name = "PostalCode" Type = "Int32" />
 </ComplexType>
 <EntityType Name = "Customer">
     <Key>
          <PropertyRef Name = "Email" />
     </Key>
     <Property Name = "Name" Type = "String" /> 
     <Property Name = "Email" Type = "String" Nullable = "false" />
     <Property Name = "Address" Type = "Addr" />
 </EntityType>

A relationship type is defined as specifying the end points and their multiplicities. For example, a one-to-many relationship between Customer and Orders can be defined as

 <Association Name = "CustomerAndOrders">
     <End Type = "Customer" Multiplicity = "1" />
     <End Type = "Orders" Multiplicity = "*">
         <OnDelete Action = "Cascade"/>
     </End>
 </Association>

Entity Data Model design approaches

Database First

Entity Framework supports several approaches for creating Entity Data Models. Database First approach was historically the first one. It appeared in Entity Framework v1, and its support was implemented in Visual Studio 2008 SP1. This approach considers that an existing (legacy) database is used, or the new database is created first, and then Entity Data Model is generated from this database with Entity Data Model Wizard.[4][11].

All needed model changes in its conceptual (CSDL) and mapping (MSL) part are performed with Entity Data Model Designer.[12]. If the storage part needs changing, the database must be modified first, and then Entity Data Model is updated with Update Model Wizard [13].

Database First approach is supported in Visual Studio 2008/2010 for MS SQL Server only. However, there are third party solutions that provide Database First support in Visual Studio for other database servers: DB2, EffiProz, Firebird, Informix, MySQL, Oracle, PostgreSQL, SQLite, Sybase, and VistaDB.[14]. Besides, there are third party tools that extend or completely replace standard Entity Data Model Wizard, Entity Data Model Designer and Update Model Wizard:[15]

Model First

A new Model First approach was supported in Visual Studio 2010, which was released together with the second Entity Framework version (Entity Framework v4). In Model First approach the development starts from scratch. At first, the conceptual model is created with Entity Data Model Designer, entities and relations are added to the model, but mapping is not created. After this Generate Database Wizard is used to generate storage (SSDL) and mapping (MSL) parts from the conceptual part of the model and save them to the edmx file. Then the wizard generates DDL script for creating database (tables and foreign keys) [16][17][18]

If the model was modified, the Generate Database Wizard should be used again to keep the model and the database consistent. In such case, the generated DDL script contains DROP statements for tables, corresponding to old SSDL from the .edmx file, and CREATE statements for tables, corresponding to new SSDL, generated by the wizard from the conceptual part. In Model First approach developer should not edit storage part or customize mapping, because they will be re-generated each time when Generate Database Wizard is launched.[16][17][19]

Model First in Visual Studio 2010 is supported only for MS SQL Server.[16]. However there are third-party solutions that provide support for Oracle, MySQL, and PostgreSQL.[20] Besides, there are third party tools for complete replacement of Entity Data Model Designer and Generate Database Wizard in the context of Model First approach.[15]

Code First

Code First is the most recent approach included in RC. The model is defined via classes and configuration written by the developer and via conventions included in the framework itself.

Querying data

Entity SQL

ADO.NET Entity Framework uses a variant of the structured query language, named Entity SQL, which is aimed at writing declarative queries and updates over entities and entity relationships – at the conceptual level. It differs from SQL in that it does not have explicit constructs for joins because the EDM is designed to abstract partitioning data across tables. Querying against the conceptual model is facilitated by EntityClient classes, which accepts an Entity SQL query. The query pipeline parses the Entity SQL query into a command tree, segregating the query across multiple tables, which is handed over to the EntityClient provider. Like ADO.NET data providers, an EntityClient provider is also initialized using a Connection object, which in addition to the usual parameters of data store and authentication info, requires the SDL schema and the mapping information. The EntityClient provider in turn then turns the Entity SQL command tree into an SQL query in the native flavor of the database. The execution of the query then returns an Entity SQL ResultSet, which is not limited to a tabular structure, unlike ADO.NET ResultSets.

Entity SQL enhances SQL by adding intrinsic support for:

Entity SQL Canonical Functions

Canonical functions are supported by all Entity Framework compliant data providers. They can be used in an Entity SQL query. Also, most of the extension methods in LINQ to Entities are translated to canonical functions. They are independent of any specific database. When ADO.NET data provider receives a function, it translates it to the desired SQL statement.[21]

But not all DBMSs have equivalent functionality and a set of standard embedded functions. There are also differences in the accuracy of calculations. Therefore, not all canonical functions are supported for all databases, and not all canonical functions return the same results.[22]

Group Canonical functions[21]
Aggregate functions Avg, BigCount, Count, Max, Min, StDev, StDevP, Sum, Var, VarP
Math functions Abs, Ceiling, Floor, Power, Round, Truncate
String functions Concat, Contains, EndsWith, IndexOf, Left, Length, LTrim, Replace, Reverse, Right, RTrim, Substring, StartsWith, ToLower, ToUpper, Trim
Date and Time functions AddMicroseconds, AddMilliseconds, AddSeconds, AddMinutes, AddHours, AddNanoseconds, AddDays, AddYears, CreateDateTime, AddMonths, CreateDateTimeOffset, CreateTime, CurrentDateTime, CurrentDateTimeOffset, CurrentUtcDateTime, Day, DayOfYear, DiffNanoseconds, DiffMilliseconds, DiffMicroseconds, DiffSeconds, DiffMinutes, DiffHours, DiffDays, DiffMonths, DiffYears, GetTotalOffsetMinutes, Hour, Millisecond, Minute, Month, Second, Truncate, Year
Bitwise functions BitWiseAnd, BitWiseNot, BitWiseOr, BitWiseXor
Other functions NewGuid

LINQ to Entities

The LINQ to Entities provider allows LINQ to be used to query various RDBMS data sources. Several database server specific providers with Entity Framework support are available.

Native SQL

In the Entity Framework v4 new methods ExecuteStoreQuery() and ExecuteStoreCommand() were added to the class ObjectContext.

Visualizers

Visual Studio has a feature called Visualizer. A Linq query written in Visual Studio can be viewed as Native SQL using a Visualizer during debug session. A Visualizer for Linq to Entities (Object Query) targeting all RDBMS is available via VisualStudioGallery.

Entity Framework ADO.NET providers

Database Provider name
Microsoft SQL Server Analysis Services (SSAS) SSAS Entity Framework Provider
DB2 IBM Data Server Provider
Caché InterSystems Caché Managed Provider
EffiProz EffiProz ADO.NET Provider
Firebird Firebird ADO.NET Data Provider
Informix IBM Data Server Provider
Microsoft SQL Server SqlClient
MySQL Devart dotConnect for MySQL
MySQL Connector/NET
Oracle DataDirect ADO.NET Data Provider for Oracle
Devart dotConnect for Oracle
PostgreSQL Devart dotConnect for PostgreSQL
Npgsql
SQLite Devart dotConnect for SQLite
System.Data.SQLite
Sybase Sybase iAnywhere
Virtuoso Universal Server Virtuoso ADO.Net Provider
VistaDB VistaDB provider

See also

References

  1. ^ ADO .NET Entity Framework Vote of No Confidence
  2. ^ "Update on the Entity Framework in .NET 4 and Visual Studio 2010". ADO.NET team blog. May 11, 2009. http://blogs.msdn.com/adonet/archive/2009/05/11/update-on-the-entity-framework-in-net-4-and-visual-studio-2010.aspx. Retrieved 2011-11-01. 
  3. ^ CSDL, SSDL, and MSL Specifications, MSDN, http://msdn.microsoft.com/en-us/library/bb399604.aspx, retrieved 2010-12-06 
  4. ^ a b Entity Data Model Wizard, MSDN, http://msdn.microsoft.com/en-us/library/bb399247(VS.100).aspx, retrieved 2010-12-06 
  5. ^ Kogent Solutions Inc. (2009), ASP.NET 3.5 Black Book, Dreamtech Press, ISBN 8177228315 
  6. ^ a b Simple Types (EDM), MSDN, http://msdn.microsoft.com/en-us/library/bb399213(VS.100).aspx, retrieved 2010-12-06 
  7. ^ ComplexType Element (CSDL), MSDN, http://msdn.microsoft.com/en-us/library/cc716799(VS.100).aspx, retrieved 2010-12-06 
  8. ^ Conceptual Model Types, MSDN, http://msdn.microsoft.com/en-us/library/bb399548(VS.100).aspx, retrieved 2010-12-06 
  9. ^ OnDelete Element (CSDL), MSDN, http://msdn.microsoft.com/en-us/library/cc716734(VS.100).aspx, retrieved 2010-12-06 
  10. ^ Facets (CSDL), MSDN, http://msdn.microsoft.com/en-us/library/cc716737(VS.100).aspx, retrieved 2010-12-06 
  11. ^ Creating an Entity Data Model from a Database, MSDN, http://msdn.microsoft.com/en-us/data/ff191186.aspx, retrieved 2010-12-06 
  12. ^ ADO.NET Entity Data Model Designer, MSDN, http://msdn.microsoft.com/en-us/library/cc716685.aspx, retrieved 2010-12-06 
  13. ^ Update Model Wizard (Entity Data Model Tools), MSDN, http://msdn.microsoft.com/en-us/library/cc716705.aspx, retrieved 2010-12-06 
  14. ^ ADO.NET Data Providers for Entity Framework, MSDN, http://msdn.microsoft.com/en-us/data/dd363565.aspx, retrieved 2010-12-06 
  15. ^ a b Section Tools in Beginner's Guide to the ADO.NET Entity Framework, MSDN, http://msdn.microsoft.com/en-us/data/ee712907.aspx, retrieved 2010-12-06 
  16. ^ a b c Generate Database Wizard (Entity Data Model Tools), MSDN, http://msdn.microsoft.com/en-us/library/dd456817.aspx, retrieved 2010-12-06 
  17. ^ a b How to: Generate a Database from a Conceptual Model (Entity Data Model Tools), MSDN, http://msdn.microsoft.com/en-us/library/dd456815.aspx, retrieved 2010-12-06 
  18. ^ Creating an Entity Data Model the Model-First Way, MSDN, http://msdn.microsoft.com/en-us/data/ff628199.aspx, retrieved 2010-12-06 
  19. ^ Database Generation Rules (Generate Database Wizard), MSDN, http://msdn.microsoft.com/en-us/library/dd456825.aspx, retrieved 2010-12-06 
  20. ^ Entity Framework 4 Release Candidate supported, Devart, http://www.devart.com/blogs/dotconnect/?p=2062, retrieved 2010-12-06 
  21. ^ a b Canonical Functions (Entity SQL), MSDN, http://msdn.microsoft.com/en-us/library/bb738626(VS.100).aspx, retrieved 2010-03-29 
  22. ^ Entity Framework Canonical Functions, Devart, 2010-03-04, http://www.devart.com/blogs/dotconnect/?p=1161, retrieved 2010-03-29 

Bibliography

External links